AIbase
Home
AI Tools
AI Models
MCP
AI NEWS
EN
Model Selection
Tags
Low-latency speech generation

# Low-latency speech generation

Kimi Audio 7B
MIT
Kimi-Audio is an open-source foundational audio model that excels in audio understanding, generation, and dialogue.
Speech Recognition Supports Multiple Languages
K
moonshotai
55
15
Seamless M4t V2 Large
SeamlessM4T v2 is a large-scale multilingual multimodal machine translation model released by Facebook, supporting speech and text translation for nearly 100 languages.
Text-to-Audio Transformers Supports Multiple Languages
S
facebook
64.59k
821
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
English简体中文繁體中文にほんご
© 2025AIbase